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Abstract. We study a class of controlled rough differential equations. It is shown that the value 
function satisfies a HJB type equation; we also establish a form of the Pontryagin maximum 
principle. Deterministic problems of this type arise in the duality theory for controlled diffusion 
processes and typically involve anticipating stochastic analysis. We propose a formulation based 
on rough paths and then obtain a generalization of Roger's duality formula [L. C. G. Rogers, 
Pathwise Stochastic Optimal Control. SIAM J. Control Optim. 46, 3, 1116-1132, 2007] from 
discrete to continuous time. We also make the link to old work of [M. H. A. Davis and G. 
Burstein, A deterministic approach to stochastic optimal control with application to anticipative 
optimal control. Stochastics and Stochastics Reports, 40:203-256, 1992], 



1. Introduction 

In classical works [11, 29] Doss and Sussmann studied the link between ordinary and stochastic 
differential equations (ODEs and SDEs, in the sequel). In the simplest setting, consider a nice 
vector field a and a smooth path B : [0,T] — » R, one solves the (random) ordinary differential 
equation 

X = a(X)B, 

so that Xt = e aBt Xo, where e aBt denotes flow, for unit time, along the vector field a (■) Bt- The 
point is that the resulting formula for X t makes sense for any continuous path B, and in fact the 
Ito-map B i — y X is continuous with respect to || 'Hoo-ro ri- particular, one can use this procedure 
for every (continuous) Brownian path; the so-constructed SDE solution then solves the Stratonovich 
equation, 

dX = a {X) o dB = a {X) dB + i (aa') (X) dt. 

When B = B (u) : [0, T] —> R d is a multidimensional Brownian motion, which we shall assume 
from here on, this construction fails and indeed the Ito-map is notorious for its lack of (classical) 
continuity properties. Nonetheless, many approximations, B n — > B, examples of which include 
the piecewise linear -, mollifier - and Karhunen-Love approximations, have the property that the 
corresponding (random) ODE solutions, say to dX n = b (X n ) dt+a (X n )dB n wither = {a\, . . . , ad), 
convergence to the solution of the Stratonovich equation 

dX = b {X) dt + a (X) o dB. 
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(At least in the case of piecewise linear approximations, this result is known as Wong-Zakai theo- 
rem 1 ) It was a major step forward, due to T. Lyons [23], to realize that the multidimensional SDE 
case can also be understood via deterministic differential equations (known as rough differential 
equations); they do, however, require more complicated driving signals (known as rough paths) 
which in the present context are of the form 

B(w) : [0,T] -^R d ®so{d), 

and "contain", in addition to the Brownian sample path, Levy stochastic area, viewed as process 
with values in so(d). Among the many applications of rough paths to stochastic analysis (eg. [14] 
and the references therein) let us just mention here that (i) all Wong-Zakai type results follow from 
B n —> B in a rough path metric (ii) the (rough)pathwise resolution of SDE can handle immediately 
situations with anticipating randomness in the coefficients; consistency with anticipating SDE in 
the sense of Ocone-Pardoux [26] was established in [7]. 

The purpose of this paper is to explore the interplay of rough paths with control problems; 
more specifically, in the context of controlled differential equations with usual aim of maximizing a 
certain payoff function over a fixed time horizon [0, T]. As is well known, the dynamic programming 
principle, in the context of ODEs, leads to Hamilton-Jacobi (HJ) equations for the value function, 
i.e. non-linear first order partial differential equations, for the value function. Optimal control can 
be characterized with the aid of the Pontryagin maximum principle (PMP) which essentially is the 
method of characteristics applied to the HJ equations. As will be discussed in detail in Section 3 
of this paper, all this can be done for general (i.e. deterministic) rough differential equations, say 
of the form 

dX = b (X, n)dt + cr (X) dri, 

where r\ is a rough path and /i = (fj, t ) a control. The value function then solves a so-called 
rough viscosity equation; as introduced in [6, 9]; see also the pathwise stochastic control problems 
proposed by Lions-Souganidis [21], further studied by Buckdahn-Ma [4[. One can, of course, apply 
this (rough) pathwise to SDEs, just take r] = B (w); however, the (optimal) control [i = /i t (uS) will 
depend anticipatingly on the Brownian driver. Moreover, the w-wise optimization has (at first 
glance) little to with the classical stochastic control problem in which one maximizes the expected 
value, i.e. an average over all w's, of a payoff function. 

Making the link between deterministic and such classical stochastic control problem is the purpose 
of Section 4. In a discrete time setting, Wets [30] first observed that stochastic optimization 
problems resemble deterministic optimization problems up to the nonanticipativity restriction on 
the choice of the control policy. The continuous time setting, i.e. studying the link between 
controlled ordinary and controlled stochastic differential equations, goes back to Davis-Burstein 
[8] . The (meta) theorem here is actually a duality of the form 



sup [... + P*) 



(Dl) : supE[...] =E 
for a suitable penality P*, or more generally 

sup [... +P] 



(D2) : supE[...] = inf E 



where v denotes an adapted control, \i a possibly anticipating control, the dots . . . may stand for a 
payoff such as g (Xt)- Note that (D2) has an immediate practical advantage: any choice of P gives 



although, strictly speaking, the multi-dimensional case is due to M. Clark and Stroock-Varadhan. 
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an upper bound ("duality bound") on the value function and therefore complements lower bounds 
obtained from picking a particular strategy v. 

In the context of continuous time optimal stopping problems, (Dl) was established by Davis- 
Karatzas [17], (D2) is due to Rogers [28], see also [16], with important applications to the (Monte 
Carlo) pricing of American options. The extension of (D2), with concrete martingale penality 
terms, to general control problem in discrete time only was carried out by Rogers in [27], see also 
[3]. 

While the discrete time setting of Rogers apriori avoids all technicalities related to measurability 
(e.g. sup^ vs. esssup„, the meaning of X controlled by anticipating (j,* ...) such things obviously 
matter in the work of Davis-Burstein. And indeed, the Ocone-Pardoux techniques (notably the 
representation of anticipating SDE solutions via flow decomposition) play a key role in their work. 

Our contribution in section 4 is then twofold: 

(I) . We give a "general" duality result in continuous time and see how it can be specialized to yield 
a version of Roger's duality (D2) for control problems in continous time. Another specialization 
leads to the Davis-Burstein result, which we review in this context. (Both approaches are then 
compared explicitly via computations in LQC problems.) 

(II) We make the case that rough path analysis is ideal to formulate and analyze these problems. 
Indeed, it allows to write down, in a meaningful and direct way, all the quantities that one wants 
to write down - without any headache related to afore-mentioned (measurability /anticipativity) 
technicalities : throughout, all quantities depend continuously on some abstract rough path r\ and 
are then - trivially - measurable upon substituion r\ <— B [uS). 

2. Notation 

For a > | denote by C a = C°> a (E) the space of geometric, a-H61der rough paths over E, 
where E is a Banach space chosen according to context. 2 On this space, we denote by p a _Hoi the 
corresponding inhomgenous distances. 

Let U be some separable metric space (the control space) . Denote by M. the class of measurable 
controls fi : [0, T] — >• U. 

When working on a filtered probability space (fi, J 7 , P), A will denote the class of progressively 
measurable controls v : 57 x [0, T] —> U . 

3. Deterministic control with rough paths 

Let r\ : [0,T] — > R d be a smooth path. Write X — X 1 ^^ for the solution to the controlled 
ordinary differential equation 

(3.1) ,/.y:--' >,(.Y; '-.,/jr/s . .(.y: •-)'/'/,. x?* = x 

Classical control theory allows to maximize f (s, A* ,:r,u , /i s ) ds + g (X^,) over a class of admissible 
controls fi. As is well-known, 

(3.2) v (t, x) := sup | ( T j (s, X*™, ft.) ds + g (X^) 

M Jt 



2 For a > i these are just Holder continuous S-valued paths. For a G (1/3, 1/2] additional "area" information is 
necessary. We refer to [24] and [14] for background on rough path theory. 
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is the (under some technical conditions: unique, bounded uniformly continuous) viscosity solution 
to the HJB equation 

-dtv - H (t, x, Dv) - (a (x) , Dv) f] = 0, 

v(T,x) = g(x). 

where H acting on v is given by 

(3.3) H(t,x,p) = sup {(b (x,u) ,p) + f (t,x,u)} 

u 

Now, (3.1) also makes sense for a driving rough path (including a controlled drift terms to the 
standard setting of RDEs is fairly straight-forward; for the reader's convenience proofs are given in 
the appendix). This allows to consider the optimization problem (3.2) for controlled RDEs. 

3.1. HJB equation. The main result here is that the corresponding value function satisfies a 
"rough PDE". Such equations go back to Lions-Souganidis (in [21] they consider a pathwise stochas- 
tic control problem and give an associated stochastic HJB equation, see also [4]; these correspond 
to T) = B(cj) in the present section). However their (non- rough) pathwise setup is restricted to 
commuting diffusion vector fields ai,. . . ,ad (actually, [21] consider constant vector fields). Exten- 
sions to more general vector fields via a rough pathwise approach was then obtained in [6] (see also 
[10]). 

Definition 1. Let r) G C°' Q be a geometric rough path, a G (0, 1]. Assume F,G,(p to be such that 
for every smooth path rj there exists a unique BUC viscosity solution to 

-dtv 71 - F(t, x, u",W,DV) - G(t, x, v 71 , = 0, 

w"(T,x) = (f>(x). 

We say that v G BUC solves the rough partial differential equation 

-dv - F(t, x, v, Dv, D 2 v)dt - G(t, x, v, Dv)dq t = 0, 

v(T,x) = (j>(x), 

if for every sequence of smooth paths 77" such that 77™ — > r\ in rough path metric we have locally 
uniformly 

u r '" -> v. 

Remark 2. (1) We remark that uniqueness of a solution, if it exists, is built into the definition 
(by demanding uniqueness for the approximating problems). 
(2) In special cases (in particular the gradient noise case of the following theorem) it is possible 
to define the solution to a rough PDE through a coordinate transformation, if the vector 
fields in front of the rough path are smooth enough. This approach is followed in [20] . 

The two definitions are equivalent, if the coefficients admit enough regularity (see [6] ). 
In the following theorem the coordinate transformation is not applicable, since a is only 
assumed to be Lip 7 instead of Lip 7+2 . 

Theorem 3. Let 77 e C°- Q (IR d ) be a rough path, a £ (±, ±]. Let 7 > 1/a. Let b : W x U -> M e 
be continuous and let b(-,u) G Lip 1 (R e ) uniformly in u G U . Let o\, . . . ,0 d & Lip 7 (IR e ). Let 
g G _BC/C(R e ). Let f : [0, T] x R e x U — > R be bounded, continuous and locally uniformly continuous 
in t, x, uniformly in u. 
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For /j, £ M. consider the RDE with controlled drift 3 (Theorem 26), 
(3.4) ,/y < »" = b (Jf^'^/i) dt + a (Jf*.*./*.*) dr], x\™" = x. 

Then 



v (t, x) := v n (t, x) := sup 



is the unique bounded, uniformly continuous viscosity solution to the rough HJ equation 

-dv - H (x, Dv) dt - (cr (x) , Dv) drj = 0, 

v(T,x) = g(x). 

Proof. The case / = 0, a £ Lip 7+2 appears in [9]. The general case presented here is different, since 
we cannot use a coordinate transformation. Let a smooth sequence rf 1 be given, such that 77" — >- 77 
in C Q a . Let 

v n (t,x) := sup E t , x [r) n ,fi\, 

where St^'y, fi\ := f (s, X 1 ^^'' 1 , fi s ) ds+g (X^ 31 '' 1 ' 7 ) for any (rough) path 7. By classical control 
theory (e.g. Corollary III. 3. 6 in [1]) we have that v n is the unique bounded, continuous viscosity 
solution to 

-dv n - H (x, Dv n ) dt - (a (x) , Dv n ) drf 1 = 0, 

v n (T,x)=g(x). 

Then 

\v n {t,x) - v(t,x)\ < sup \Z ttX [r),ij]-Et, x [r] n ,tj]\' 

Note that S is continuous in 7 uniformly in /1 (and (t, x)) by Theorem 26. Therefore, v" converges 
locally uniformly to v and then, by Definition 1, v solves (3.5). □ 

Example 4. In the case with additive noise (o-(x) = Id) and state-independent gains / drift 
(f(s,x,u) = f(s,u),b(x,u) = b(s,u)), this rough deterministic control problem admits a simple 
solution. Indeed, if V is the value function to the standard deterministic problem for rj = 0, i.e. 

v°(t,x) = sup f(s,fi s )ds+g \x + b(s, /ijdsj j , 

then one has immediately (since r\ only appears in the terminal gain) 

v r >(t,x) = v°(t,x + r ]T -ri t ). 

When v° has a nice form, this gives simple explicit solutions. For instance, assuming in addition 
f = 0, U convex and b(s, u) — u, v a is reduced to a static optimization problem and 

ifit, x) = sup g(x + r\ T — r\ t + (T — t)uj . 

uEU 



Extension to time-dependent b, a would be straightforward, see [14, ch. 12]. It is also possible to consider the 
controlled hybrid RDE/SDE dX = b(X, fj,)dt + a(X, a)dW + a{X)dr), see [9]. 
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3.2. Pontryagin maximum principle. If r\ is smooth, then Theorem 3.2.1 in [31] gives the 
following optimality criterium. 

Theorem 5. Let n be a smooth path. Assume b, f,g be C 1 in x, such that the derivative is Lipschitz 
in x, u and bounded and let a, g be C 1 with bounded, Lipschitz first derivative. Let X,Ji be an optimal 
pair for problem (3.2) with t — 0. Let p be the unique solution to the backward ODE 

-p(t) = Db(X u fit)p(t) + Da(X t )r] t p(t) + Df(X t ,fi t ), 
P (T) = Dg(X T ). 

Then 

b(X t ,fit)p(t) + f(X t ,Ji t ) = sup [b(X t , u)p{t) + f(X t , «)] , a.e. t e [0, T]. 

Let now rj be rough. We have the following equivalent statement. 

Theorem 6. Assume the same regularity on b,f,g as in Theorem 5. Assume <n, ...,<7<j G 
Lip 7+2 (R e ). Let T] G C°' a be a geometric rough path, a G (1/3,1/2]. Let X,p, be an optimal 
pair. Let p be the unique solution to the controlled, backward RDE 

-dp(t) = Db(X t ,fi t )p(t)dt + Da(X t )p(t)dv t + Df(X t ,]l t )dt, 

p(T) = Dg(X T ). 

Then 

b{X u fl t )p(t) + f(X t , Jx t ) = sup [b(X t ,u)p{t) + f(X t ,u)] , a.e. t. 

Remark 7. This is the necessary condition for an admissible pair to be optimal. In the classical 
setting there do also exist sufficient conditions (see for example Theorem 3.2.5 in [31],). They 
rely on convexity of the Hamiltonian and therefore will in general not work in our setting because, 
informally, the dr\-term switches sign all the time. 

We prepare the proof with the following Lemma. 

Lemma 8. Let X,J1 be an optimal pair. Let \i be any other control. Let Id [0,T] be an interval 
with |/| = e. Define 

fi £ (t) := lj(t)/i(i) + l [0 ,r|\/(t)/i(t). 

Let X e be the solution to the controlled RDE (3.4) corresponding to the control [i e . Let Y 6 be the 
solution to the RDE 

Yf= f Db{X r ,Jl r )Y r E dr+ f Da(X r )Y r E dri r + f [b(X r ,p r ) - b(X r fi r )] l z (r)dr. 

Jo Jo 



0(e), 
o(e), 

(Dg(X T ),YS) 

+ f [(Df(X r ,il r ),Y r ") + {f(X r ^ r )~f(X r ,fi r )}l I (r)]dr + o(e). 



Then 

(3.6) sup\Xt-X t \ = 
t 

(3.7) 

su P |x t E -x t -r/| = 

t 

J(^) ~ J(p) = 

(3.8) 
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Proof. The joint RDE reads as 



X t = x 
X £ = x 



b (X r , n r ) dr + a (X r ) drj r 



f b(X £ rl ^ £ r )dr + a(X £ )dri r 
Jo 

Y t e = f Db(X r ,il r )Y r £ dr+ f Da(X r )Y £ drj r + f [b(X r , fx r ) - b(X r fl r )] h(r)dr. 
Jo Jo Jo 

By Theorem 26 we can write the solution S £ = (X, X s , Y £ ) as <j)(t, S £ ) where 

d(j){t,s) = a(4>(t, s))dr) t 

and S = (X,X £ ,Y e ) with 
where 

cr(xi) 

a(xx,x 2 ,x 3 ) = I a(x 2 ) 



Da(xi)x 3 , 



and B(t, (x±, x 2 , x 3 ), (iti, u 2 )) is given by 



9x2^2 X))b((j) 2 (t, X), U 2 ) 

d x ^ 3 \t, <f>{t, x)) [Db(Mt, x), Ul )<j> 3 (t, x) + {6(0! (t, x), u) - 6(0! (t, x), u)} l,(t)] 

+e?x, (t, (i, x),ui) 



\ 



From Lemma 3.2.2 in [31] it follows that 

sup\X £ -X t \ = 0(s). 
t 

From this we can deduce (3.6) using the Lispchitzness of <f>. Furthermore, (3.7) follows from 

snp\X £ -X t -Y £ \=o(e) 
t 

and 

x (i, (a, 6, b~ a)) - 2 (i, (a, 6,6 -a)) - 3 (t, (a, 6, 6- a)) = 0(|6 - a| 2 ). 

(note that d Xl 4> 3 1 (t, <j>(t, {x\,x 2 , 0))) = 0). Finally (3.8) follows by direct calculation from (3.6) and 
(3.7). □ 

Proof. We follow the idea of the proof of Theorem 3.2.1 in [31]. Fix xq E R e . Without loss of 
generality we take t = 0. Define for /j, E M 

J{p) ■■= I* f(r,X° r '*°>™,» r )dr + g(X% x °>™), 
Jo 

(so that v(0,xq) = inf pe x J(fJ,)). 
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Since rj is geometric, we have 



{Dg(X T ),YS) = (pr t Yf) - ( Po ,Y s ) 

i-T 



(Df(X r ,j2 r ),Y r s )dr + / (p(r), [b{X r ,n r ) - b(X r ,fi r )] lj{r))dr. 
Jo 

Here, Y £ and / are given as in Lemma 8. 

Let any u G U be given. Let fi(t) = u. Let t € [0, T) and let e > small enough such that 
J e := [t, t + e] C [0, T]. Then, combined with Lemma 8 we get 

> J(aO - J{P) 

= (Dg(X T ),YS) + f [(Df(x r ,fi r ),Y r s ) + {f(X r ,^ r ) - f(X r ,fi r )} lj(r)] dr + o(e) 



(Df(X r ,fi r ),Y;)dr+ / (p(r), [b(X r , fx r ) - b(X r ,Jl r )] h(r))dr 



+ / [{Df{x r ^r)^) + {f{X r ,pL r ) - f{X r ,Jx r )}^i{r)}dr + o{e) 



t+e 

(p(r), [6(X r! u) - &(X r , Mr)] ) + /(^r, u) - f(X r , K)dr + o(e). 

Dividing by e and sending e — > yields, together with the separability of the metric space, the 
desired result. □ 

3.3. Pathwise stochastic control. We can apply Theorem 3 to enhanced Brownian motion, i.e. 
take rj = B (lu), Brownian motion enhanced with Levy's stochastic area which constitutes for a.e. 
u a geometric rough path. The (rough) pathwise unique solution to the RDE with controlled drift, 
^'^It^bO) then becomes a solution to the classical stochastic differential equation (in Stratonovich 
sense) (Theorem 26). 

Proposition 9. Under the assumptions of Theorem 3, the map 



0J h-> SUp 



f{s,XM,n s )ds + g{X™)S 

) 77=B(w) 

is measurable. In particular, the expected value of the pathwise optimization problem, 



(3.9) v(t,x)=E 
is well-defined. 







sup < 









f{s,X^^ s )ds + g{X^) 



»j=BM 



Proof. The lift into rough pathspace, ui t— > B(w), is measurable. v n as element in BUC space 
depends continuously (and hence: measurably) on the rough path rj. Conclude by composition. □ 

Remark 10. Well-definedness of such expressions was a non-trivial technical obstacle in previous 
works on pathwise stochastic control; e.g. [8, 4]. The use of rough path theory allows to bypass this 
difficulty entirely. 
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Remark 11. Let us explain why we only consider the case where the coefficient o~(x) in front of 
the rough path is not controlled. It would not be too difficult to make sense of RDEs 



assuming good regularity for a and (u s ) s >q chosen in a suitable class (for instance : u piecewise 
constant, u controlled by rj in the Gubinelli sense,...). However, in most cases of interest the control 
problem would degenerate, in the sense that we would have 



The reason is that if a has enough u-dependence (for instance if d = 1, U is the unit ball in R e and 
o~(x,u) — u) and rj has unbounded variation on any interval (as is the case for typical Brownian 
paths), the system can essentially be driven to reach any point instantly. 

In order to obtain nontrivial values for the problem, one would need the admissible control pro- 
cesses to be uniformly bounded in some particular sense (see e.g. [25] in the Young case, where the 
(fi s ) need to be bounded in some Holder space), which is not very natural (for instance, Dynamic 
Programming and HJB-type pointwise optimizations are then no longer valid). 

4. Duality results for classical stochastic control 

We now link the expected value of the pathwise optimization problem, as given in (3.9), to the 
value function of the (classical) stochastic control problem as exposed in [18, 12], 



Here, the sup is taken over all admissible (in particular: adapted) controls v on [t, T}. There are 
well-known assumptions under which V is a classical (see [18]) resp. viscosity (see [12]) solution to 
the HJB equation, i.e. the non-linear terminal value problem 



dX = b(t, X, u)dt + a(X, u)dr) 




(4.1) 




-d t V ~ F (t, x, DV, D 2 V) = 
V(T,.) = g; 

uniqueness holds in suitable classes. In fact, assume the dynamics 4 



d 




i=l 



where b(x, u) = b(x, u) + ^ SiLil '* ' Doi)(x) is the corrected drift. Then the equation is semilinear 
of the form 



-d t V - H (i, x, DV) — LV = 0, 
V(T,-)=g. 



Again, it would be straightforward to include explicit time dependence in the vector fields ft, o-;. 
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where 



and H is given by H (t, x,p) = sup. 
Let us also write 



'u 




L U V = b{-,u)DV + LV, 



ueU. 



Denote by A the class of progressively measurable controls is : x [0, T] — > U. As before M is 
the class of measurable function /i : [0, T] —> U, with the topology of convergence in measure (with 
respect to dt). 

Theorem 12. Let Zjr be the class of all mappings z : C 0a xM^R such that 

• z is bounded, measurable and continuous in r\ G C°' a uniformly over fi G M. 

• E[z(B,i/)] > 0, if v is adapted 

Let b : M. e xU — > R e be continuous and let &(•, it) G Lip 1 (R e ) uniformly in u G U . Let 0"i, . . . ,aa G 
Lip 7 (M e ), /or some 7 > 2, 5 G BC/C(R e ) and / : [0, T] x M e x ?7 -» K bounded, continuous and 
locally uniformly continuous in t,x, uniformly in u. Then we have 



Where B denotes the Stratonovich lift of Brownian motion to a geometric rough path and X *> x >' i > T ' 
is £/ie solution to the RDE with controlled drift (Theorem 26) 



Remark 13. Every choice of admissible control v G A in (4-1) leads to a lower bound on the value 
function (with equality for v = is* , the optimal control). In the same spirit, every choice z leads 
to an upper bound. There is great interest in such duality results, as they help to judge how much 
room is left for policy improvement. The result is still too general for this purpose and therefore it 
is an important question, discussed below, to understand whether duality still holds when restricting 
to some concrete (parametrized) subsets of Zjr. 

Proof. We first note, that the supremum inside the expectation is continuous (and hence measur- 
able) in 77, which follows by the same argument as in the proof of Theorem 3. Since it is also 
bounded, the expectation is well-defined. 

Recall that X t,x,v is the solution to the (classical) controlled SDE and that X t,x '^ is the solution 
to the controlled RDE. Let z G Zjr. Then, using Theorem 26 to justify the step from second to 



V{t, x) = inf E sup 




</.V" ' " = b (X*"'"", At) dt + a (X t ' x '»' r >) drj, X\ 



= X. 
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third line, 



V(t,x) = supE 

veA 



< supE 
v eA 



sup E 
v&A 



< E 



f( S ,Xl^,u s )d S + g(X^ v ) + z(B,u) 



f(s, .V; ' » s )ds + g{x¥>»») + z( V , M ) 



fi—u^rf—Ji 



sup 

VGM I Jt 



f(s, XZ**™, pi s )ds + g(X^) + z( V , u) 



?7=B 



And to show equality, let 



(77, fi) := V(t, x) / f(s, .V: ^' ". njda + g(X^). 



Then z* G and equality is attained. 



□ 



4.1. Example I, inspired by the discrete-time results of Rogers [27]. We now show that 
Theorem 12 still holds with penalty terms based on martingale increments. 

Theorem 14. Under the (regularity) assumptions of Theorem 12 we have 



V (t, x) = inf E 

heel- 2 



sup 



jjT/( S ,X<™ Ms ) ds + g(X^) 



Mt.x.ii.rj.h 
IT 



tj=B(a(). 



where 



27ia£ is, Theorem 12 still holds with Zjr replaced by the set {z : z(r),n) = M*'^'^'* 1 ' , h G C^' 2 }. 



Moreover, if V G C^' 2 the infimum is achieved at h* = V. 
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Proof. We have 
V(t,x) 

< inf 

heel' 

had- 2 \ 



sup < 


1/ 


MEM 





?7=B 



sup 

liGM I Jt 



< inf x) + sup 

heCl' 2 \ Jt zGR e ,ti£ 



r7=B 



[/(s, x, u) + (<9 S + L u )h(s, x)} ds + sup [5(2) - /i(T, a;)] 



< inf h(t, x) + 
hest \ 

< inf h(t,x). 
hes+ 



sup [f(s, x, u) + (d s + L u )h(s, x)] ds + sup [g(x) - h(T, x)] 
t xeR c ,ueu igi c 



where the first inequality follows from (the proof of) Theorem 12 and denotes the class of 
smooth super solutions of the HJB equation. 

But in fact the infimum of smooth supersolutions is equal to the viscosity solution V, all in- 
equalities are actually equalities and the result follows. This can be proved via a technique due to 
Krylov [19] which he called "shaking the coefficients". For the reader's convenience let us recall the 
argument. 

Extending by continuity 6, a and / to t 6 (— oo, oo), define for e > 0, 



sup 

ueU,\s\ , |e| <e 



F%t,x,p,X) 
and consider V s the unique viscosity solution to 



(b(x + e, u),p) + ^Tr(aa T (x + e)X) +f(t + s,x + e, u) 



dt 



F E (t,x,DV e ,D 2 V E ) = 0, 
V e (T,-) = g. 



By (local) uniform continuity of b, er, f one can actually show that V — > V e as e — > 0, locally 
uniformly. This can be done for instance by interpreting V 6 as the value function of a stochastic 
control problem. 

Now take some smoothing kernel p e with L e +i P e = 1 an d su PP(/? e ) C [— £,e] e+1 , and define 
V £ := V e * p e . Clearly by definition of F £ , for each \s\, |e| < e, V e (- — s, ■ — e) is a supersolution to 
the HJB equation -d t V - F(t, x, DV, D 2 V) = 0. Since F is convex in (DV, D 2 V) it follows that 



V e (■ — s, ■ — e)p e (s, e)dsde 



is again a (smooth) supersolution (for the details see the appendix in [2]). Finally it only remains 
to notice that \V - V £ \ < \V - V * p £ \ + \(V - V e ) * p E \ (locally uniformly). □ 
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Remark 15. Note that 
V h (t,x) := E 



sup { f T f{s,X^^^ s ) ,1s ■ g(X%*'™) 



M, 



t,T 



) 77=B(w) 

for fixed x, t, is precisely of the form 3.9 with f resp. g replaced by f resp. g, given by 

= f(s,;») + (d s + L»)h(s,-), 
g(-) = g(-) + h(T,.)-h(t,x). 
The point is that the inner pathwise optimization falls directly into the framework of Section 3. 
Remark 16. For rj a geometric rough path, we may apply the chain rule to h(s,X s ) and obtain 

l-T 



h(T,X T ) - h(t,x) 



(Dh(s, X s ), b(s, X s ,n s )ds + a{X s )d Vs ). 



It follows that the penalization may also be rewritten in a (rough) integral form 



M 



t.T 



{Dh{s, X s ),a(X s )drj s )+ f { ((6 - b)(s, X s ,» s ),Dh(s, X s )) + Tr[(aa T )D 2 h](s, X s )} da. 



Note that for t] = B and adapted v , this is just the ltd integral {Dh(s, X s ), a(X s )dB s ) . 

Remark 17. If one were to try anticipating stochastic calculus, in the spirit [8], to implement 
Roger's duality in continuous time, then - leaving aside all other technical (measurability) issues 
that have to be dealt with - more regularity on the coefficient will be required. This is in stark 
contrast to the usual understanding in SDE theory that rough paths require more regularity than ltd 
theory. 

Example 18. From example ^ we can see that in some special cases this method gives explicit 
upper bounds. Assume : 

• additive noise (a = Id), 

• state-independent drift b = b(u), 

• running gain f(s,x,u) = f (u) + Vh(x) ■ b(u), with h subharmonic (Ah >0). 

Then for the penalty corresponding to h(t,x) = h(x), the inner optimization problem is given by 



sup 
IJ.GM I Jt 



< sup 



(f ( A + (Vh(Xl>*<^), b(fi s )))ds + g (X 



T ) 



{(vh(xr-^)M^) + ±Ah(xr>^))d S 



f (p s ) ds + (g h) (X^) = V°' h (t, x + Vt - n t ), 



where V > is the value function to the standard control problem 
V°(t, x) = sup J f f(fi s )ds + (g-h)(x + 

AtGM [Jt \ 

From Theorem 14, we then have the upper bound 

V[t, x) < h(x) + E [V°- h [t, x + B T - B t )] . 
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Remark 19. As in Remark 11, one can wonder how Theorem 14 could translate in the case where 
a depends on u. As mentioned in that remark, under reasonable conditions on a the control problem 
degenerates so that for any choice of h, say for piecewise- constant controls pi, we can expect that 



sup ■ 



/(*, .V: 1 *' ".//.: + (d t + L»°)h(s,Xl< x ^)ds + giX^'W) - h(T,X l T x ^) 



T7=B 



sup [f(s, x, u) + (dt + L u )h(s, x)] ds + sup [g(x) — h(T, x)] . 

t x£Wl',u£U i6l e 

In other words there is nothing to be gained from considering the (penalized) pathwise optimization 
problem, as we always get 



V(t,x) < h(t,x) + 



sup [f(s, x, u) + (d t + L u )h(s, x)] ds + sup [g(x) - h(T, x)] 

t x£WL<>,u£U xgl e 



which is in fact clear from a direct application of ltd 's formula (or viscosity comparison). 

4.2. Example II, inspired by Davis Burstein [8]. Under certain concavity assumptions, it 
turns out that linear penalization is enough. 

Theorem 20. Let g be as in Theorem 12 and assume f = 0; furthermore make the (stronger) 
assumption that b £ C^, a £ C|, aa T > 0, and that (4.1) has a feedback solution u* which is 
continuous, C 1 in t and in x, taking values in the interior of U . Assume that U is a compact 
convex subset ofW 1 , 

Let Z t:x ' n be the solution starting from x at time t to : 



(4.4) 



dZ = b(Z,u*{t,Z))dt + o-(Z)dri-b u (Z,u*(t,Z))u*(t,Z)dt, 



let W(t,x) := W(t,x\rf) := g(Ztf, x ' 11 ) and assume that 



(4.5) 
Then 



\/(t,x), u i— > (b(x,u), DW(t, x;B)) is strictly concave, a.s. 



V(t,x) = inf E[ sup I / 



f(r, X}*™, fi r )dr + g(X^) + / <A(r, X*™", r]),(* r )dr 



Where A is the class of all A : [0,T] x K e x C°' a -)■ R d such that 

• A is bounded and uniformly continuous on bounded sets 

• A is future adapted, i.e. for any fixed t,x, X(t, x, B) £ Tt,T 

• E[A(i,a;,B)] = for all t,x. 

That is, Theorem 12 still holds with Zjr replaced by the set 

{z : z( V , fx) = { T (\( Sl X*™*, v), Vs)ds, A £ A}. 



Moreover the infimum is achieved with \*(t,x,rf) := b^(t,u*(t,x))DW(t,x;rj). 

Remark 21. The concavity assumption is difficult to verify for concrete examples. It holds for the 
linear quadratic case, which we treat in Section 4-3. 
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Remark 22. The case of running cost f is, as usual, easily covered with this formulation. Indeed, 
let the optimal control problem be given as 

dX = b(X, v)dt + a(X) o dW, 
V(t,x) = supE[[ f{X,u)dr + g{X T )). 

Define the new component 

dXf +1 = f(X, u)dt, X? +1 = x. 
Then the theorem yields that the penalty 

\*(t, x) := (b u , f u ) ■ (D X1 d g(Z T ) + D Xl d Z c T +1 ,D Xe+l Z e T +1 ) 
= (buju) ■ (D X1 d g(Z T ) + D X1 d Z e T + \ 1). 

is optimal, where 

dZ = b(Z, u*)dt + a(Z)dri - b u (Z, u*)u*dt, 
dZ e+1 = f(Z, u*)dt - f u (Z, u*)u*dt. 

Proof. From (the proof of) Theorem 12 we know V(t, x) < vaix^A^\- ■ ■]■ The converse direction is 
proven in [8] by using 5 

X*(t,x,r]) = b^(t,u*(t,x))DW(t,x). 
For the reader's convenience we provide a sketch of the argument below. 

□ 

Sketch of the Davis-Bur stein argument. We have assumed that the optimal control for the stochas- 
tic problem (4.1) is given in feedback form by u*(t,x). Write X l ' x >* := X t > x ' u . 
Recall that Z 1 ^^ is the solution starting from x at time t to : 

dZ = b(Z,u*(t,Z))dt + o-{Z)dr]-b u (Z,u*(t,Z))u*(t,Z)dt. 

Assume that W(t,x) := W(t,x;rf) = g(Z^ x ' v ) is a (viscosity) solution to the rough PDE 

(4.6) -d t W - (b(x,u*(t,x)) -b u (x,u*(t,x))u*(t,x),DW) - (a(x),DW)fi = 0, 

and assume that W is differentiable in x. 
We assumed that 

V(t, x), u M> (b(x, u), DW(t, x)) is strictly concave. 

It then follows that 

(4.7) {b(x,u*(t,x)) - b u (x,u*(t,x))u*{t,x),DW) = sup (b(x,u) - b u (x,u*(t,x))u,DW) . 

Because of (4.7) the PDE (4.6) may be rewritten as 

-d t W - {b{x, u*{t,x)),DW) - (u*(t,x),X*{t,x;rj)) - (a (x) , DW) rj 
= -d t W- sup {(b(x,u),DW) - (u,X*(t,x;r])}} - (a(x),DW}r) 

= 0. 



5 The paper of Davis— Burstein predates rough path theory and their proof relies on anticipating stochastic calculus. 
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By verification it follows that actually W is also the value function of the problem with penalty A*, 
and the optimal control is given by u*, i.e. 

i-T 



W(t,x) = W(t,x;rj) = sup 



Then, by Theorem 26 we have (if the convexity assumption (4.5) is satisfied a.s. by r\ = B(w)) 
W(t,x;B) = g(X% x <*) -J (X*{s,X^ x '*;B),u*(s,X t s ' x '*)) ds. 

ft follows in particular that for the original stochastic control problem 

V(t,x) = supE^X^)] = E[g(X%*>')] 



vEA 



3 



(X*! 8 '*) - f (X*(s, Xl ,x '*; B), u*(s, X***)) ds 
sup L;.V^ ii "i f T (\*(s,X t s ' x '^' r, ]r]), ^i s ) ds\ \ v= i 

iGM Jt 



(J.GM 

Here we have used that X*(t, x, B) is future adapted and E[A*(t, x, B)] = Vi, x. which is shown on 
p. 227 in [8]. 

□ 

Remark 23. The two different penalizations presented above are based on verification arguments 
for respectively the stochastic HJB equation and the (rough) deterministic HJB equation. One can 
then also try to devise an approach based on Pontryagin's maximum principles (both stochastic and 
deterministic). While this is technically possible, the need to use sufficient conditions in the rough 
PMP means that it can only apply to the very specific case where a is affine in x, and in consequence 
we have chosen not to pursue this here. 

4.3. Explicit computations in LQC problems. We will compare the two optimal penalizations 
in the case of a linear quadratic control problem (both for additive and multiplicative noise). 

4.3. f. LQC with additive noise. The dynamics are given by 6 

(4.8) dX = (MX + Nv)dt + dB t 
and the optimization problem is given by 

(4.9) V(t,x) =sup„ eA E[±tf((QX s ,X s ) + (Rv Sl v s ))ds + ±(GX T ,X T ) 
This problem admits the explicit solution (see e.g. Section 6.3 in [31]) 



(4.10) 



V(t,x) 



±{P(t)x,x) + ± 



Tr(P(s))ds, 



''This equation admits an obvious pathwise SDE solution (via the ODE satisfied by X — B) so that, strictly 
speaking, there is no need for rough paths here. 
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where P is the solution to the matrix Riccati equation 
P{T) = G, 

P'(t) = -P(t)M - t MP(t) + PNR- lt NP(t) -Q, 

and the optimal control is then given in feedback form by 

v*{t,x) = -R- u NP(t)x. 

Proposition 24. For this LQ control problem the optimal penalty corresponding to Theorem 20 is 
given by 

where 

A x (i; 77) = -*N J 1 ' e M ^- t) P(s)dr )s . 
The optimal penalty corresponding to Theorem H is given by 

where 

J R (V)= / T (P( S )X s °,d,7 s )-i [ T Tr(P(s))ds 7 



X° denoting the solution to the RDE dX = MX + dr\ starting at (t,x). 5 In particular, these 
two penalizations are equal modulo a random constant (not depending on the control) with zero 
expectation. 

Proof. The formula for z 1 is in fact already computed in [8, sec. 2.4], so that it only remains to do 
the computation for the Rogers penalization. 
It follows from Remark 16 that 



£(DV(s t X B ),dri B ) ~\ f Tr(D 2 V(s,X s ))ds 

J (P(s)Xg,dri s ) -\ ^ Tr(P(s))ds 

(P(s)(X° + J' e M ^Nf, r dr),d Vs ) Tr(P(s))ds 

far,?* J e M ^P(s)d Vs ))dr + j t (P(s)X°,dr, s )-^J t Tr(P(s))ds. 

Hence we see that this penalization can be written as z 2 = z 1 + j R (r]), where J R {t]) does not 
depend on the chosen control. One can check immediately that E[7 fl (?7)| 77=B ( u )] = 0. 

□ 
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4.3.2. LQC with multiplicative noise. Let the dynamics be given by 

n 

(4.11) dX = (M X + Nv)dt + CiX o dB\ 

i=l 
n 

(4.12) = (MX + Np)dt + ^2CiXdB l t . 

i=l 

Denote by X t ' x ' ,1 ' TI the solution starting from x at time t to 

n 

dx t, x ,^v = tMxi ' r 'i + Nfx)dt + c,.v; • -"./j,: 

i=l 

and by r t;S the (matrix) solution to the RDE 

n 

d s r t>s = Mr t , s ds + Y c t r t , s dr] s , r t , t = / 

i=l 

Then 

.v: r ; ' " = I r> .r + J r r>s Nfi r dr. 

For simplicity we now take d = n = 1: the general case is only notationally more involved. 
The optimization problem is given by 

(4.13) V(t,x) = sup„ e ^E \$t(QX 2 s +Rv 2 s )ds + \GX%, . 
By Section 6.6 in [31] the value function is again given as 

V{t,x) = l -P t x 2 

and the optimal control as 

u*(t,x) = -R^NPtx, 

where 

(4.14) P t + 2P t M + 2P t C 2 + Q - ^R^P 2 = 0, P T = G. 
We can then compute explicitely the Davis-Burstein and Rogers penalties : 
Proposition 25. For t < r <T, define 

Q r = J Ps^lAdris-Cds). 

ponding to Theorem 20 is giv 

z 1 ^,^) = —CNxJ^ Q s [i s ds, 

while the optimal penalty corresponding to Theorem H is given by 

z 2 {r],^) = CO t x 2 + CNx [ t t , s Qs^ s ds + CN 2 ( [ T r ^ rVs Q rys n r n s drds. 

Jt Jt Jt 



Then the optimal penalty corresponding to Theorem 20 is given by 
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Proof. The optimal penalty stemming from Theorem 20 (see also Remark 22) is given by A* (r, x)fi r dr, 
where 

A*(r, a:) = iV (GZ^d x Z?t + d x Z^) - NP{r)x, 

where 

dZ 1 = MZ 1 ds + CZ 1 dr\ u Z x r = x; 
dZ 2 = - (Q - NiR-^is) 2 ) {Z v ) 2 ds, Z 2 = 0. 
Since Z\ = T rs x, this is computed to 

X*(r,x) = Nx ^GT 2 r T + (Q - N 2 R- l P{s) 2 ) T 2 %s ds - P(r)^J . 

= Nx ([P(s)T 2 rjS ] T s=r + £ (-P(s) - 2MP(s) - 2C 2 P(s)) T 2 rs ds^j 



Nx 



£ F P(s)Tl s (Cd Vs -C 2 ds)j 



= NCxQ r . 

For the optimal penalty corresponding to Theorem 14, we apply again Remark 16 to see that 
the optimal penalty is given by 

r-T 



M tm,V = J ( DV ( s ,X s ),CXdr] s ) - J Tr[C 2 X 2 D 2 V(s,X s )}ds 

= J P r C\X t r ' x '»< t1 \ 2 d<n r -J C 2 \X l r x -^P r dr. 

It only remains to perform straightforward computations expanding the quadratic terms and 
applying Fubini's theorem. □ 

5. Appendix: RDEs with controlled drift 

Theorem 26 (RDE with controlled drift). Let a G (1/3, 1/2]. Let r) G C°' Q a geometric a-Holder 
rough path. Let 7 > —. Let U be the subset of a separable Banach space. Let b : R e x U — > R e 
sucht that b{-,u) G Lip 1 (R e ) uniformly in u G U (i.e. sup ueU \\b(-, u)||Lip 1 (R=) < 00/ an d such that 
u M> b{-, u) is measurable. Let eri, . . . ,o~d G Lip 7 (R e ). Let fx : [0,T] — »• U be measurable, i.e. \x G A4. 
(i) Then there exists a unique Y G C°' Q that solves 

Y t = y Q + f b(Y r ,n r )dr+ [ a{Y)dr) r . 
Jo Jo 

Moreover the mapping 

(x ,v) GC°< Q 
is locally Lipschitz continuous, uniformly in fj, G M.. 
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(ii) Assume moreoever that U 3 u i— > b(-,u) G Lip 1 (R e ) is Lipschitz. If we use the topology of 
convergence in measure on M then 

M x R e x C°' a C°' Q 

(5-1) 

is continuous. 

(Hi) If v : £1 x [0,T] — > U is progressively measurable and B is the Stratonovich rough path lift 
of a Brownian motion B, then 

(5-2) Y\^, v =n=y, P -as., 

where Y is the (classical) solution to the controlled SDE 

Y t = y + f b(Y r ,v r )dr+ [ a(Y)odB r . 
Jo Jo 

(iv) If moreover o~i, . . . ,o~d £ Lip 7+2 (R e ), we can write Y = <f>(t, Y t ) where <j> is the solution flow 
to the RDE 

cf>(t,x)=x+ / a((j}(r,x))dr] r , 
Jo 

and Y solves the classical ODE 

Y t =x + / b(r, Y r , n r )dr, 
Jo 

where we define componentwise 

b(t,x,u)i = ^d Xk 4>~ 1 (t,(f>(t,x))b k ((t)(t,x),u). 

k 

Remark 27. In the last case, i.e. point (iv), we can immediately use results in [14] (Theorem 
10.53) to also handle linear vector fields. 



Proof. Denote for /i e M 



Z?(-):= b(-,(i r )dr, 



which is a well defined Bochner integral in the space Lip 1 (M e ) (indeed, by assumption on b, 
So \\K;Vr)\\u P ^m°)dr < oo). Then e (ji- Holder ( [o, T] , Lip 1 (M e ) ) . Indeed 



7H 



\\Zt -^||Lipi(R e ) = II / 6 (-,Mr)^llLipi(R e ) 



r-l 

< 



< \t-s\ sup ||6(-,u)|| Lip i (R e), 



SO 



(5-3) WZ^Wi- Holder < SUP 1 , It) | | Lip l (Re) < OO, 

ueu 

independent of [i € M. 
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By Theorem 29 we get a unique solution to the RDE 

dY = f(Y)dZ fl + a(Y)dq 

where / : M e — > L(Lip 1 (R e ), ]R e ) is the evaluation operator, i.e. f(y)V := V(y). This gives 
existence of the controlled RDE as well as continuity in the starting point and in r\. By (5.3), this 
is independent of fj, E M. and we hence have shown (i). 

Concerning (ii), assume now that U 3m4!i(-,ii)£ Lip is Lipschitz. Using the representation 
given in the proof of (i) it is sufficient to realize that if \i n — > \i 6 M. in measure, then Z 11 — > Z^ 
in C 3 - H6WeT '([0,T],Lip 1 (M e )), for all (3 <1. 

Concerning (iii): first of all, we can regard v as a measurable mapping from (SI, into the 
space of all measurable mappings from [0, T] — >• U with the topology of convergence in measure. 
Indeed, if U is a compact subset of a separable Banach space, then this follows from the equivalence 
of weak and strong measurability for Banach space valued mappings (Pettis Theorem, see Section 
V.4 in [32]). If U is a general subset of a separable Banach space, then define v n : O — > J\4 with 
v n (uS) t := § n (y(uS)t). Here $" is a (measurable) nearest-neighbor projection on {xi, . . . ,x n }, the 
sequence (xk)k>o being dense in the Banach space. Then v n is taking values in a compact set and 
hence by the previous case, is measurable as a mapping to M., Finally v is the pointwise limit of 
the v n and hence also measurable. 

Hence Y\ v B is measurable, as the concatenation of measurable maps (here we use the 
joint continuity of RDE solutions in the control and the rough path, i.e. continuity of the mapping 
(5.1)). 

Now, to get the equality (5.2): we can argue as in [13] using the Riemann sum representation of 
stochastic integral. 

(iv) This follows from Theorem 1 in [9] or Theorem 2 in [5]. □ 

Remark 28. One can also prove "by hand" existence of a solution, using a fixpoint argument, like 
the one used in [15]. This way one arrives at the same regularity demands on the coefficients. Using 
the infinite dimensional setting makes it possible to immediately quote existing results on existence, 
which shortens the proof immensely. We thank Terry Lyons for drawing our attention to this fact. 

In the proof of the previous theorem we needed the following version of Theorem 6.2.1 in [24]. 

Theorem 29. Let V, W, Z be some Banach spaces. Let tensor products be endowed with the pro- 
jective tensor norm. 7 Let a 6 (1/3, 1/2]. and T] £ C°' a (W) and Z £ C l3 ~ Halder ([0, T], V) for some 
f3 > 1 — a. Let f : Z — >• L(V, Z) be Lip 1 , let g : Z — > L(W, Z) be Lip 7 , 7 > p. Then there exists a 
unique solution Y G C 0,a (Z) to the RDE 

dY = f(Y)dZ + g(Y)dr), 

in the sense of Lyons. 8 

Moreover for every R > there exists C = C{R) such that 

Pa-H6l( Y , Y ) < C\\Z - Z\\p-H8lder 

whenever (Z,X) and(Z,X) are two driving paths with \\Z\\p_ H6lder .[ 0tT ], \\Z\\p_ Hti i der .[^ T] , \\X\\ a _ Htilder .[^ T] < 
R. 



See [22] p. 18 for more on the choice of tensor norms which, of course, only matter in an infinite dimensional 
setting. 

8 See e.g. Definition 5.1 in [22]. 
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Proof. Since Z and X have complementary Young regularity (i.e. a + /3 > 1) the joint rough path 
A over (Z,X), where the missing integrals of Z and the cross- integrals of Z and X are defined via 
Young integration. So we have 



A s . t = 1 



Z s ,t \ , I f s Z s . r (g) dZ r f s Z s<r (g) dr\ r 

X *>* ) \ Is 11 s ,r ® dZ r f s Vs,r ® *7r 



Then, by Theorem 6.2.1 in [24], there exists a unique solution to the RDE 

dY = h(Y)d\, 

where h = {f,g). 

We calculate how A depends on Z. For the first level we have of course IIA«-A (1) |L -Holder < 
\\Z — Z\\p-Hoider- For the second level we have, by Young's inequality, 

I f, Z s , r dZ r — J Z s ^ r dZ r \ | J Z s ^ T d \Z r — Z r ] \ J Z s r — Z s ^ r dZ r \ 
sup ; - < sup r 77, h sup ; 

s<t \t - S\ 2a - s <t \t - S| 2 « s <t \t - S| 2 « 

< C\\Z\\p-Hdlder\\Z — Z\ \j3-H6lder, 

and similarily 

| J s X s ,rdZ r - J s X s ,rdZ r \ =.. 

SU P u i?Z — Z ~ Z \\I3-Hdlder;\s,t]- 

s<t \t-S\ za 

Plugging this into the continuity estimate of Theorem 6.2.1 in [24] we get 

p a -^y)<c\\z-z\\ p _ mider 

as desired. □ 
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